Confirm Delete?
Are you sure you want to remove from the report?

Data Preview

Tail

num_households num_total_bedrooms num_bedrooms_per_room num_population num_longitude num_total_rooms num_median_income num_housing_median_age num_latitude median_house_value
16507 0.173803 0.126209 -0.995393 0.423061 0.774716 0.623958 1.063654 -0.208681 -0.857561 237400.000000
16508 0.655510 0.606421 -0.394476 0.294826 -1.128449 0.749567 0.278522 0.425777 0.442331 221300.000000
16509 -0.720345 -0.767680 -0.781626 -0.694107 0.589894 -0.592189 0.913161 0.663699 -0.768719 336000.000000
16510 0.025828 -0.093647 -0.591079 0.403500 0.929567 0.107052 -0.334358 0.425777 -0.731312 128800.000000
16511 2.572891 2.669018 0.854992 1.622822 -0.913656 1.719127 -0.674998 -0.843140 1.377505 114500.000000

Pre Processing

Pre Processing - Imputations

Pre Processing - Imputations - Missing

No missing values in data

Pre Processing - Imputations - Infinitys

No Infinity values

Health Analysis

Health Plot

Missing Plot

Missing Value Summary

No Missing Values

Duplicate Columns

No duplicate variables

Outliers In Features

Data Shape:(16512, 10)
feature < (mean-3*std) > (mean+3*std) < (1stQ - 1.5 * IQR) > (3rdQ + 1.5 * IQR) -inf +inf
num_households 0 456 0 966 0 0
num_total_bedrooms 0 447 0 1025 0 0
num_bedrooms_per_room 0 225 0 500 0 0
num_population 0 430 0 948 0 0
num_total_rooms 0 456 0 1014 0 0
num_median_income 0 316 0 545 0 0
median_house_value 0 0 0 854 0 0

Feature Analysis

Summary Stats

Summary Stats - Numeric Variables

Variable Name Datatype No of Unique Samples Mean Standard Deviation Min 25th percentile Median 75th percentile Max
0 median_house_value float64 3677 [162500.00000001, 239800.00000001, 102900.00000001001, 194400.00000001, 333600.00000001] 206892.639838 115401.991073 14999.000000 119300.000000 180150.000000 264925.000000 500001.000000
1 num_bedrooms_per_room float64 15117 [-0.3224372233041336, -0.24501017670722167, -0.3879437658595288, -0.28794539285247206, -0.17902274801845858] -0.020040 0.896956 -1.975985 -0.635616 -0.179023 0.447645 3.000091
2 num_households float64 1346 [-1.022591917416956, 0.0856480012494535, -0.5314401352807063, 3.0000908471496897, -0.9470301047806098] -0.013872 0.953174 -1.532634 -0.657376 -0.241787 0.369005 3.000091
3 num_housing_median_age float64 52 [1.8533096397104474, 1.2981582355983967, -1.8741355021847503, 1.6153876093767114, -1.3189840980726997] 0.000000 1.000030 -2.191365 -0.843140 0.029241 0.663699 1.853310
4 num_latitude float64 839 [-0.712608981395422, 1.0081114364846906, 1.265284325026557, -0.9697818699372852, -0.7453400763007503] -0.000000 1.000030 -1.451397 -0.801451 -0.647147 0.970704 2.948598
5 num_longitude float64 819 [0.6947930751562609, -1.318265927647774, -1.2033767041626782, 0.9245715221264453, 0.5998845861903149] -0.000000 1.000030 -2.382240 -1.108468 0.534947 0.784706 2.632924
6 num_median_income float64 10641 [0.22597178277100904, 0.4011776819481103, -0.7787351974323108, 0.5750201110927051, 0.6017213992292936] -0.004596 0.985547 -1.895644 -0.723856 -0.172275 0.515482 3.000091
7 num_population float64 3218 [-0.9853538149194075, -0.22246224922423102, -0.5984743314614548, 3.000090847149703, -1.01252231797123] -0.015575 0.946453 -1.506989 -0.656072 -0.243110 0.361117 3.000091
8 num_total_bedrooms float64 1436 [-1.1321780181114893, 0.05099491464253844, -0.5333594433435583, 3.0000908471496843, -0.2498607944196104] -0.014746 0.949832 -1.511140 -0.654859 -0.249861 0.348958 3.000091
9 num_total_rooms float64 5028 [-1.0870985060910747, 0.09489652957015172, -0.45095132480229516, 3.000090847149681, -1.0737851437893078] -0.017734 0.938173 -1.476659 -0.639654 -0.248936 0.342641 3.000091

Summary Stats - Non Numeric Variables

No categorical columns

Distributions

Distributions - Numeric Variables

Distributions - Numeric Variables - Median House Value

Distributions - Numeric Variables - Num Bedrooms Per Room

Distributions - Numeric Variables - Num Households

Distributions - Numeric Variables - Num Housing Median Age

Distributions - Numeric Variables - Num Latitude

Distributions - Numeric Variables - Num Longitude

Distributions - Numeric Variables - Num Median Income

Distributions - Numeric Variables - Num Population

Distributions - Numeric Variables - Num Total Bedrooms

Distributions - Numeric Variables - Num Total Rooms

Distributions - Non Numeric Variables

No categorical variables in data.

Feature Normality

Feature Interactions

Correlation Table

Variable 1 Variable 2 Corr Coef Abs Corr Coef
0 num_households num_total_bedrooms 0.974076 0.974076
1 num_latitude num_longitude -0.924518 0.924518
2 num_total_bedrooms num_total_rooms 0.916483 0.916483
3 num_households num_total_rooms 0.913881 0.913881
4 num_households num_population 0.908932 0.908932
5 num_population num_total_bedrooms 0.874524 0.874524
6 num_population num_total_rooms 0.843648 0.843648
7 median_house_value num_median_income 0.693776 0.693776
8 num_bedrooms_per_room num_median_income -0.677507 0.677507
9 num_housing_median_age num_total_rooms -0.387211 0.387211
10 num_housing_median_age num_total_bedrooms -0.334554 0.334554
11 num_housing_median_age num_population -0.318111 0.318111
12 num_households num_housing_median_age -0.315666 0.315666
13 median_house_value num_bedrooms_per_room -0.281947 0.281947
14 num_median_income num_total_rooms 0.239352 0.239352
15 num_bedrooms_per_room num_total_rooms -0.212832 0.212832
16 median_house_value num_total_rooms 0.158967 0.158967
17 median_house_value num_latitude -0.148357 0.148357
18 num_bedrooms_per_room num_housing_median_age 0.136583 0.136583
19 num_housing_median_age num_median_income -0.134340 0.134340
20 num_latitude num_population -0.125824 0.125824
21 num_bedrooms_per_room num_total_bedrooms 0.124559 0.124559
22 num_bedrooms_per_room num_latitude -0.114660 0.114660
23 num_housing_median_age num_longitude -0.112100 0.112100
24 num_longitude num_population 0.111507 0.111507
25 median_house_value num_housing_median_age 0.107959 0.107959
26 num_bedrooms_per_room num_households 0.095830 0.095830
27 num_bedrooms_per_room num_longitude 0.093409 0.093409
28 num_latitude num_median_income -0.087794 0.087794
29 num_households num_latitude -0.078968 0.078968
30 median_house_value num_households 0.070829 0.070829
31 num_latitude num_total_bedrooms -0.067996 0.067996
32 num_bedrooms_per_room num_population 0.067048 0.067048
33 num_longitude num_total_bedrooms 0.066009 0.066009
34 num_households num_longitude 0.057956 0.057956
35 median_house_value num_total_bedrooms 0.054087 0.054087
36 median_house_value num_longitude -0.041582 0.041582
37 num_longitude num_total_rooms 0.035246 0.035246
38 median_house_value num_population -0.032884 0.032884
39 num_latitude num_total_rooms -0.031465 0.031465
40 num_median_income num_total_bedrooms -0.013883 0.013883
41 num_households num_median_income 0.013682 0.013682
42 num_housing_median_age num_latitude 0.013563 0.013563
43 num_longitude num_median_income -0.010789 0.010789
44 num_median_income num_population 0.003234 0.003234

Correlation Heatmap

Covariance Heatmap

Bivariate Plots (top 50 Correlations)

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Total Bedrooms

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Population

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Population

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Median Income

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Housing Median Age Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Housing Median Age Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Households Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Households Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Median Income

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Longitude

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Median Income

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Population Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Latitude

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Latitude Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Bedrooms Per Room Vs Median House Value

Bivariate Plots (top 50 Correlations) - Num Housing Median Age Vs Num Households

Bivariate Plots (top 50 Correlations) - Num Population Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Total Bedrooms Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Total Rooms Vs Num Housing Median Age

Bivariate Plots (top 50 Correlations) - Num Median Income Vs Num Bedrooms Per Room

Bivariate Plots (top 50 Correlations) - Num Longitude Vs Num Latitude

Key Drivers

Median House Value

Median House Value - Feature Scores - Feature Correlation

Median House Value - Feature Importances - From Model

Median House Value - Pca Analysis

Median House Value - Pca Analysis - Pca Projection

Median House Value - Pca Analysis - Correlation With Dimension 2 (y)

Median House Value - Pca Analysis - Correlation With Dimension 1 (x)

Median House Value - Bivariate Plots

Median House Value - Bivariate Plots - Num Households

Median House Value - Bivariate Plots - Num Total Bedrooms

Median House Value - Bivariate Plots - Num Bedrooms Per Room

Median House Value - Bivariate Plots - Num Population

Median House Value - Bivariate Plots - Num Longitude

Median House Value - Bivariate Plots - Num Total Rooms

Median House Value - Bivariate Plots - Num Median Income

Median House Value - Bivariate Plots - Num Housing Median Age

Median House Value - Bivariate Plots - Num Latitude